Learning Simpler Language Models with the Differential State Framework

نویسندگان

  • Alexander Ororbia
  • Tomas Mikolov
  • David Reitter
چکیده

Learning useful information across long time lags is a critical and difficult problem for temporal neural models in tasks such as language modeling. Existing architectures that address the issue are often complex and costly to train. The differential state framework (DSF) is a simple and high-performing design that unifies previously introduced gated neural models. DSF models maintain longer-term memory by learning to interpolate between a fast-changing data-driven representation and a slowly changing, implicitly stable state. Within the DSF framework, a new architecture is presented, the delta-RNN. This model requires hardly any more parameters than a classical, simple recurrent network. In language modeling at the word and character levels, the delta-RNN outperforms popular complex architectures, such as the long short-term memory (LSTM) and the gated recurrent unit (GRU), and, when regularized, performs comparably to several state-of-the-art baselines. At the subword level, the delta-RNN's performance is comparable to that of complex gated architectures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Validated Framework for Gamified Electronic Teaching of English Language with a Systemic Approach to Instructional Design

One way of making learning English language interesting is through its gamified instruction, especially nowadays that most teachings are done virtually. Doing so, however, requires a valid framework. To construct such a framework a set of 22 peer reviewed papers on language learning and gamified teaching, from among 94 of such papers, was content analyzed and key concepts, and frequently used e...

متن کامل

Integrative Systemic Therapy in Foreign Language Learning: A Practical Framework for Managing Foreign Language Listening Anxiety among Iranian EFL Learners

This study aimed at proposing a framework based on Integrative Systemic Therapy (IST) for addressing foreign language listening anxiety (FLLA) among Iranian learners of English as a Foreign Language (EFL). The effectiveness of the framework in reducing the levels of FLLA among language learners was investigated when implemented by a therapist as well as by learners independently. To this end, 3...

متن کامل

Modeling and Evaluation of Stochastic Discrete-Event Systems with RayLang Formalism

In recent years, formal methods have been used as an important tool for performance evaluation and verification of a wide range of systems. In the view points of engineers and practitioners, however, there are still some major difficulties in using formal methods. In this paper, we introduce a new formal modeling language to fill the gaps between object-oriented programming languages (OOPLs) us...

متن کامل

Modeling and Evaluation of Stochastic Discrete-Event Systems with RayLang Formalism

In recent years, formal methods have been used as an important tool for performance evaluation and verification of a wide range of systems. In the view points of engineers and practitioners, however, there are still some major difficulties in using formal methods. In this paper, we introduce a new formal modeling language to fill the gaps between object-oriented programming languages (OOPLs) us...

متن کامل

Perceptual Learning Style Preferences and Computer-Assisted Writing Achievement within the Activity Theory Framework

Learning styles are considered among the significant factors that aid instructors in deciding how well their students learn a second or foreign language (Oxford, 2003). Although this issue has been accepted broadly in educational psychology,further research is required to examine the relationship between learning styles and language learning skills. Thus, the present study was carried out to in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neural computation

دوره 29 12  شماره 

صفحات  -

تاریخ انتشار 2017